The Dirichlet-Multinomial Model for Multivariate Randomized Response Data and Small Samples
نویسندگان
چکیده
In survey sampling the randomized response (RR) technique can be used to obtain truthful answers to sensitive questions. Although the individual answers are masked due to the RR technique, individual (sensitive) response rates can be estimated when observing multivariate response data. The beta-binomial model for binary RR data will be generalized to handle multivariate categorical RR data. The Dirichletmultinomial model for categorical RR data is extended with a linear transformation of the masked individual categorical-response rates to correct for the RR design and to retrieve the sensitive categorical-response rates even for small data samples. This specification of the Dirichletmultinomial model enables a straightforward empirical Bayes estimation of the model parameters. A constrained-Dirichlet prior will be introduced to identify homogeneity restrictions in response rates across persons and/or categories. The performance of the full Bayes parameter estimation method is verified using simulated data. The proposed model will be applied to the college alcohol problem scale study, where students were interviewed directly or interviewed via the randomized response technique about negative consequences from drinking.
منابع مشابه
The Analysis of Bayesian Probit Regression of Binary and Polychotomous Response Data
The goal of this study is to introduce a statistical method regarding the analysis of specific latent data for regression analysis of the discrete data and to build a relation between a probit regression model (related to the discrete response) and normal linear regression model (related to the latent data of continuous response). This method provides precise inferences on binary and multinomia...
متن کاملMultinomial Dirichlet Gaussian Process Model for Classification of Multidimensional Data
We present probabilistic multinomial Dirichlet classification model for multidimensional data and Gaussian process priors. Here, we have considered efficient computational method that can be used to obtain the approximate posteriors for latent variables and parameters needed to define the multiclass Gaussian process classification model. We first investigated the process of inducing a posterior...
متن کاملModeling Baseline Shifts in Multivariate Disease Outbreak Detection
Methods Existing multivariate algorithms only model disease-relevant data streams (e.g., anti-fever medication sales or patient visits with constitutional syndrome for detection of flu outbreak). On the contrary, we also incorporate a non-disease-relevant data stream as a control factor. We assume that the counts from all data streams follow a Multinomial distribution. Given this distribution, ...
متن کاملDRIMSeq: a Dirichlet-multinomial framework for multivariate count outcomes in genomics [version 2; referees: 2 approved]
There are many instances in genomics data analyses where measurements are made on a multivariate response. For example, alternative splicing can lead to multiple expressed isoforms from the same primary transcript. There are situations where differences (e.g. between normal and disease state) in the relative ratio of expressed isoforms may have significant phenotypic consequences or lead to pro...
متن کاملSome superpopulation models for estimating the number of population uniques
The number of the unique individuals in the population is of great importance in evaluating the disclosure risk of a microdata set. We approach this problem by considering some basic superpopulation models including the gamma-Poisson model of Bethlehem et al. (1990). We introduce Dirichlet-multinomial model which is closely related but more basic than the gamma-Poisson model, in the sense that ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2012